D$^2$: Decentralized Training over Decentralized Data

نویسندگان

  • Hanlin Tang
  • Xiangru Lian
  • Ming Yan
  • Ce Zhang
  • Ji Liu
چکیده

While training a machine learning model using multiple workers, each of which collects data from their own data sources, it would be most useful when the data collected from different workers can be unique and different. Ironically, recent analysis of decentralized parallel stochastic gradient descent (D-PSGD) relies on the assumption that the data hosted on different workers are not too different. In this paper, we ask the question: Can we design a decentralized parallel stochastic gradient descent algorithm that is less sensitive to the data variance across workers? In this paper, we present D2, a novel decentralized parallel stochastic gradient descent algorithm designed for large data variance among workers (imprecisely, “decentralized” data). The core of D2 is a variance blackuction extension of the standard D-PSGD algorithm, which improves the convergence rate from O ( σ √ nT + (nζ 2) 1 3 T2/3 ) to O ( σ √ nT ) where ζ2 denotes the variance among data on different workers. As a result, D2 is robust to data variance among workers. We empirically evaluated D2 on image classification tasks where each worker has access to only the data of a limited set of labels, and find that D2 significantly outperforms D-PSGD.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Economic Droop Scheme for Decentralized Power Management in DC Microgrids

This paper proposes an autonomous and economic droop control scheme for DC microgrid application. In this method, a cost-effective power sharing technique among various types of DG units is properly adopted. The droop settings are determined based on an algorithm to individually manage the power management without any complicated optimization methods commonly applied in the centralized control ...

متن کامل

A MULTI-OBJECTIVE DECENTRALIZED MULTIPLE CONSTRUCTION PROJECTS SCHEDULING PROBLEM CONSIDERING PERIODIC SERVICES AND ORDERING POLICIES

In decentralized construction projects, costs are mostly related to investment, material, holding, logistics, and other minor costs for implementation. For this reason, simultaneous planning of these items and appropriate scheduling of activities can significantly reduce the total costs of the project undertaken. This paper investigates the decentralized multiple construction projects schedulin...

متن کامل

Performance Evaluation of Supply Chain under Decentralized Organization Mechanism

Abstract Nowadays among many evaluation methods, data envelopment analysis has widely used to evaluate the relative performance of a set of Decision Making Units (DMUs). Data Envelopment Analysis (DEA(is a mathematical tool for evaluating the relative efficiency of a set Decision Making Units (DMUs), with multiple inputs and outputs. Traditional DEA models treat with each DMU as a “black box" t...

متن کامل

The Expected Achievable Distortion of Two-User Decentralized Interference Channels

This paper concerns the transmission of two independent Gaussian sources over a two-user decentralized interference channel, assuming that the transmitters are unaware of the instantaneous CSIs. The availability of the channel state information at receivers (CSIR) is considered in two scenarios of perfect and imperfect CSIR. In the imperfect CSIR case, we consider a more practical assumption of...

متن کامل

Decentralized prognosis of fuzzy discrete-event systems

This paper gives a decentralized approach to the problem of failure prognosis in the framework of fuzzy discrete event systems (FDES). A notion of co-predictability is formalized for decentralized prognosis of FDESs, where several local agents with fuzzy observability rather than crisp observability are used in the prognosis task. An FDES is said to be co-predictable if each faulty event can be...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2018